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INTESTINAL TREFOIL PROTEINS 
RaeVaround 

The field of the invention is peptides useful for 
5 treatment of disorders of the digestive system. 

J0rgensen et al. (Regulatory Peptides 3:231, 1982) 
describe a porcine pancreatic peptide, pancreatic 
spasmolytic peptide (PSP) . PSP was found to inhibit 
"gastrointestinal motility and gastric acid secretion in 
10 laboratory animal after parenteral as well as oral 

administration." It was suggested that "if the' results in 
animal experiments can be confirmed in man, PSP may 
posses a potential utility in treatment of gastroduodenal 
ulcer diseases. 
15 T c!iiTnTnai-v o f. the Invention 

In a first aspect, the invention features a 
purified nucleic acid encoding an intestinal trefoil 

factor (ITF). 

In preferred embodiments, the intestinal trefoil 
20 factor is mammalian intestinal trefoil factor, preferably 

human, rat, bovine, or porcine intestinal trefoil factor. 

In another preferred embodiment, the purified nucleic 

acid encoding an intestinal trefoil factor is present 

within a vector. 
25 m a related aspect, the invention features a cell 

that includes a vector encoding an intestinal trefoil 

factor. 

In another related aspect, the invention features 
a substantially pure intestinal trefoil factor. In a 
30 preferred embodiment, the polypeptide is detectably 

labelled. In a related aspect, the invention features a 
therapeutic composition that includes an intestinal 
trefoil factor and a pharmacologically acceptable 
carrier. 



gNSDOCID: <WO 0214837A1_I_> 



WO 92/14837 



PCT/US92/01200 



- 2 - 

In another aspect, the invention features a 
monoclonal antibody which preferentially binds (i.e., 
forms an immune complex with) an intestinal trefoil 
factor. In a preferred embodiment, the monoclonal 
5 antibody is detectably labelled. 

In a related aspect, the invention features a 
method for detecting human intestinal trefoil factor in a 
human patient. The method includes the steps of 
contacting a biological sample obtained from the patient 

10 with a monoclonal antibody which preferentially binds 

intestinal trefoil factor, and detecting immune complexes 
formed with the monoclonal antibody. In preferred 
embodiments the biological sample is an intestinal 
mucosal scraping, or serum. 

15 In a related aspect, the invention features a 

method for treating digestive disorders in a human 
patient, which method involves administering to the 
patient a therapeutic composition that includes an 
intestinal trefoil factor and a pharmacologically 

20 acceptable carrier. 

In another aspect, the invention features a method 
for detecting binding sites for intestinal trefoil factor 
in a patient. The method involves contacting a 
biological sample obtained from the patient with the 

25 factor, and detecting the factor bound to the biological 
sample as an indication of the presence of the binding 
sites in the sample. By "binding sites", as used herein, 
is meant any antibody or receptor that binds to an 
intestinal trefoil factor protein, factor, or analog. 

30 The detection or quantitation of binding sites may be 
useful in reflecting abnormalities of the 
gastrointestinal tract. 

In another aspect, the invention features 
substantially pure trefoil factor. In preferred 
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embodiments, the intestinal trefoil factor is human, 
porcine, or bovine trefoil factor. 

The term "intestinal trefoil factor" ("ITF") 
includes any protein which is substantially homologous to 
5 rat intestinal trefoil factor (Fig. 2, SEQ ID NO 2) and 
which is expressed in the large intestine, small 
intestine, or colon to a greater extent than it is 
expressed in tissues other than the small intestine, 
large intestine, or colon. Also included are: allelic 
10 variations; natural mutants; induced mutants; proteins 
encoded by DNA that hybridizes under high or low 
stringency conditions to ITF encoding nucleic acids 
retrieved from naturally occurring material; and 
polypeptides or proteins retrieved by antisera to ITF, 
15 especially by antisera to the active site or binding 
domain of ITF. The term also includes other chimeric 
polypeptides that include an ITF. 

The term ITF also includes analogs of naturally 
occurring ITF polypeptides. Analogs can differ from 
20 naturally occurring ITF by amino acid sequence 

differences or by modifications that do not affect 
sequence, or by both. Analogs of the invention will 
generally exhibit at least 70%, more preferably 80%, more 
preferably 90%, and most preferably 95% or even 99%, 
25 homology with all or part of a naturally occurring ITF 
sequence. The length of comparison sequences will 
generally be at least about 8 amino acid residues, 
usually at least 20 amino acid residues, more usually at 
least 24 amino acid residues, typically at least 28 amino 
30 acid residues, and preferably more than 35 amino acid 
residues. Modifications include in vivo, or in vitro 
chemical derivatization of polypeptides, e.g., 
acetylation, or carboxylation. Also included are 
modifications of glycosylation, e.g., those made by 
35 modifying the glycosylation patterns of a polypeptide 
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during its synthesis and processing or in further 
processing steps, e.g., by exposing the polypeptide to 
enzymes that affect glycosylation derived from cells that 
normally provide such processing, e.g., mammalian 
5 glycosylation enzymes. Also embraced are versions of the 
same primary amino acid sequence that have phosphorylated 
amino acid residues, e.g. f phosphotyrosine, 
phosphoserine, or phosphothreonine . Analogs can differ 
from naturally occuring ITF by alterations of their 
10 primary sequence. These include genetic variants, both 
natural and induced. Induced mutants may be derived by 
various techniques, including random mutagenesis of the 
encoding nucleic acids using irradiation or exposure to 
ethanemethylsulf ate (EMS) , or may incorporate changes 
15 produced by site-specific mutagenesis or other techniques 
of molecular biology. See , Sambrook, Fritsch and 
Maniatis (1989), Molecular Cloning; A Laboratory Manual 
(2d ed.), CSH Press, hereby incorporated by reference. 
Also included are analogs that include residues other 
20 than naturally occurring L-amino acids, e.g., D-amino 
acids or non-naturally occurring or synthetic amino 
acids, e.g., or 7 amino acids. 

In addition to substantially full-length 
polypeptides, the term ITF, as used herein, includes 
25 biologically active fragments of the polypeptides. As 
used herein, the term "fragment 11 , as applied to a 
polypeptide, will ordinarily be at least about 10 
contiguous amino acids, typically at least about 20 
contiguous amino acids, more typically at least about 30 
30 contiguous amino acids, usually at least about 40 
contiguous amino acids, preferably at least about 50 
contiguous amino acids, and most preferably at least 
about 60 to 80 or more contiguous amino acids in length. 
Fragments of ITF can be generated by methods known to 
35 those skilled in the art. The ability of a candidate 
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fragment to exhibit a biological activity of ITF can be 
assessed by methods known to those skilled in the art. 
Also included in the term are biologically active ITF 
polypeptides containing amino acids that are normally 
5 removed during protein processing, including additional 
amino acids that are not required for the biological 
activity of the polypeptide , or including additional 
amino acids that result from alternative mRNA splicing or 
alternative protein processing events. 
10 An ITF polypeptide, fragment, or anlaog is 

biologically active if it exhibits a biological activity 
of a naturally occurring ITF, e.g., the ability to alter 
gastrointestinal motility in a mammal. 

The invention also includes nucleic acid 
15 sequences, and purified preparations thereof, that encode 
the ITF polypeptides described herein. The invention 
also includes antibodies, preferably monoclonal 
antibodies, that bind specifically to ITF polypeptides. 
As used herein, the term "substantially pure" 
20 describes a compound, e.g., a nucleic acid, a protein, or 
a polypeptide, e.g., an ITF protein or polypeptide, that 
is substantially free from the components that naturally 
accompany it. Typically, a compound is substantially 
pure when at least 60%, more preferrably at least 75%, 
25 more preferrably at least 90%, and most preferrably at 
least 99%, of the total material (by volume, by wet or 
dry weight, or by mole per cent or mole fraction) in a 
sample is the compound of interest. Purity can be 
measured by any appropriate method, e.g., in the case of 
30 polypeptides by column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. 

By "isolated DNA 11 is meant that the given DNA is 
free of the genes which, in the naturally-occurring 
genome of the organism from which the given DNA of the 
35 invention is derived, flank the given DNA. The term 
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"isolated DNA" thus encompasses, for example, cDNA, 
cloned genomic DNA, and synthetic DNA. A "purified 
nucleic acid", as used herein, refers to a nucleic acid 
sequence that is substantially free of other 
5 macromolecules (e.g., other nucleic acids and proteins) 
with which it naturally occurs within a cell. In 
preferred embodiments, less than 40% (and more preferably 
less than 25%) of the purified nucleic acid preparation 
consists of such other macromolecules. 
10 "Homologous", as used herein, refers to the 

subunit sequence similarity between two polymeric 
molecules, e.g., between two nucleic acid molecules, 
e.g., two DNA molecules, or two polypeptide molecules. 
When a subunit position in both of the two molecules is 
15 occupied by the same monomeric subunit, e.g., if a 
position in each of two DNA molecules is occupied by 
adenine, then they are homologous at that position. The 
homology between two sequences is a direct function of 
the number of matching or homologous positions, e.g., if 
20 half, e.g., 5 of 10, of the positions in two compound 
sequences are homologous then the two sequences are 50% 
homologous, if 90% of the positions, e.g., 9 of 10, are 
matched or homologous the two sequences share 90% 
homology. By way of example, the DNA sequences 
25 S^TTGCC'S and S'TATGGC'S share 50% homology. By 
"substantially homologous" is meant largely but not 
wholly homologous. 

The ITF proteins of the invention are resistent to 
destruction in the digestive tract, and can be used for 
30 treatment of peptic ulcer diseases, inflammatory bowel 
diseases, and for protection of the intestinal tract from 
injury caused by bacterial infection, radiation injury or 
other insults. An ITF protein, fragment, or analog can 
also be used to treat neoplastic cancer. 
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Other features and advantages of the invention 
will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

Detailed Description 
5 The drawings will first be briefly described. 

Drawings 

Figure 1 is a depiction of the nucleotide sequence 
of rat trefoil f actor (SEQ ID NO: 1). 

Figure 2 is a depiction of the deduced amino acid 
10 sequence of rat trefoil factor (SEQ ID NO: 2). 

Figure 3 is a depiction of the amino acid 
sequences of rat trefoil factor, pS2 protein , and 
pancreatic spasmolytic polypeptide. The sequences are 
aligned so as to illustrate the amino acid sequence 
15 homology between the proteins. Dashes (-) indicate the 
insertion of spaces which improve alignment. Bars ( ) 
indicate sequence identities. 

Figure 4 depicts the disulfide bond structure 
proposed for pS2 (panel A) and PSP (panel B) ; 
20 Figure 5 is a depiction of the proposed disulfide 

bond structure of rat intestinal trefoil factor. 

Figure 6 is a depiction of the nucleotide sequence 
of the human intestinal trefoil factor cDNA and the 
corresponding deduced amino acid sequence (SEQ ID NO: 3). 
25 Purificatio n and cloning of rITF 

An inhibitor of soft agar colony formation by 
human breast carcinoma-derived BT-20 cells (ATTC HTB79) 
was isolated from cytology-positive human malignant 
effusions (Podolsky et al., Cancer Res. 48:418, 1988; 
3 0 hereby incorporated by reference). The factor also 
inhibited soft agar colony formation by human colon 
carcinoma-derived HCT15 cells ( ATTC-CCL2 25) . Inhibition 
was not observed for polyoma and murine sarcoma virus 
transformed rodent fibroblast lines. The isolated factor 
35 (transformed cell-growth inhibiting factor or TGIF) had 
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an apparent molecular weight of 110,000 kD and appeared 
to consist of two 55,000 kD subunits linked by sulfhydryl 
bonds. 

The purified protein was partially sequenced.. The 
5 sequence from the amino terminal 14 amino acids was used 
to produce a set of degenerate oligonucleotide probes for 
screening of a rat intestinal epithelial cell cDNA 
library. 

A rat intestinal cDNA library (Lambda ZAP° II, 

10 Stratagene, La Jolla, CA) was produced by standard 

techniques (Ausubel et al., eds., Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1989) 
using cells purified by the method of Weisner (J. Biol 
Chem. 248:2536, 1973). Screening of the cDNA library with 

15 the fully degenerate oligonucleotide probe described 

above resulted in the selection of 21 clones. One of the 
clones (T3411) included a core sequence which encoded a 
single open reading frame. The nucleotide sequence of the 
open reading frame and flanking DNA is presented in Fig. 

20 1 (SEQ ID NO 1) . The insert present in T3411 was nick 
translated (Ausubel et al., supra) to produce a 
radioactively labelled probe for Northern blot analysis 
of rat poly (A) + RNA. Northern analysis demonstrated that 
RNA corresponding to the cloned cDNA fragment was 

25 expressed in small intestine, large intestine, and 

kidney; no expression was detected in the lung, spleen, 
heart, testes, muscle, stomach, pancreas, or liver. In 
the tissues in which the RNA was expressed, the level was 
comparable to that of actin. 

30 The open reading frame of clone T3411 encoded an 

81 amino acid peptide (Fig. 2; SEQ ID NO 2). Comparison 
of the sequence of the encoded peptide, referred to as 
rat intestinal trefoil factor (rITF) , to the sequence of 
proteins in the Genebank database revealed significant 

35 homology to human breast cancer associated peptide (pS2; 
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Jakowlev et al., Nucleic Acids Res. 12;2861, 1984) and 
porcine pancreatic spasmolytic peptide (PSP; Thim et al., 
Biochem. Biophys. Acta 827:410, 1985). Fig. 3 illustrates 
the homology between rITF, PSP and pS2. Porcine 
5 pancreatic spasmolytic factor (PSP) and pS2 are both 
thought to fold into a characteristic structure referred 
to as a trefoil. A trefoil structure consists of three 
loops formed by three disulfide bonds. pS2 is thought to 
include one trefoil (Fig. 4A) , and PSP is thought to 

10 include two trefoils (Fig. 4B) . The region of rITF 

(nucleotide 114 to nucleotide 230 which encodes cys to 
phe) which is most similar to PSP and pS2 includes six 
cysteines all of which are in the same position as the 
cysteines which make up the trefoil in pS2 (Fig. 3). Five 

15 of these six cysteines are in the same position as the 
cysteines which form the amino terminal trefoil of PSP 
(Fig. 3). Fig. 5 depicts the proposed disulfide bond 
configuration of rITF. 

Based on homology to PSP and pS2 (Mori et al., 

20 Biochem. Biophys. Res. Comm. 155:366, 1988; Jakowlew et 
al., Nucleic Acids Res. 12:2861, 1984), rITF includes a 
presumptive pro- sequence (met 1 to ala 22 ) in which 12 of 
22 amino acids have hydrophobic side chains. 
Production of Anti-rlTF Antibodies 

25 A peptide corresponding to the carboxy-terminal 21 

amino acids of rITF was synthesized and coupled to bovine 
serum albumin (BSA) . This conjugate (and the unconjugated 
peptide) was used to raise polyclonal antibodies in 
rabbits. All procedures were standard protocols such as 

30 those described in Ausubel et al. (supra). The anti-rlTF 
antibodies were used in an indirect immunof louresce assay 
for visualization of rITF in rat tissues. Cryosections of 
rat tissues were prepared using standard techniques, and 
fluorescein labelled goat anti-rabbit monoclonal antibody 

35 (labelled antibodies are available from such suppliers 
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Kirkegaard and Perry Laboratories, Gaithersberg, MD; and 
Bioproducts for Science, In., Indianapolis, IN) was used 
to detect binding of rabbit anti-rlTF antibodies. By this 
analysis rITF appears to be present in the globlet cells 
5 of the small intestine but not in the stomach or the 
pancreas . 

Cloning of Human Intestinal Trefoil Factor 

DNA encoding the rat intestinal trefoil factor can 
be used to identify a cDNA clone encoding the human 

10 intestinal trefoil factor (hITF) . This can be 

accomplished by screening a human colon cDNA library with 
a probe derived from rITF or with a probe derived from 
part of the hITF gene. The latter probe can be obtained 
from a human colon or intestinal cDNA using the 

15 polymerase chain reaction to isolate a part of the hITF 
gene. This probe can then serve as a specific probe for 
the identification of clones encoding all of the hITF 
gene. 

Construct i on of a cDNA Library . 
20 A human colon or intestinal cDNA library in AgtlO 

or Agtll, or some other suitable vector is useful for 
isolation of hITF. Such libraries may be purchased 
(Clontech Laboratories, Palo Alto, CA: HLl034a, 
HLI0346b) . Alternatively, a library can be produced using 
25 mucosal scrapings from human colon or intestine. Briefly, 
total RNA is isolated from the tissue essentially as 
described by Chirgwin et al. (Biochemistry 18:5294, 1979; 
see also Ausubel et al., supra). An oligo (dT) column is 
then used to isolate poly (A) + RNA by the method of Aviv 
30 et al. (J. Mol. Biol. 134:743, 1972; see also Ausubel et 
al., supra;. Double-stranded cDNA is then produced by 
reverse transcription using oligo (dT) 12 _ 18 or random 
hexamer primers (or both) . RNAse H and E. coli DNA poll 
are then used to replace the RNA strand with a second DNA 
35 strand. In a subsequent step E. coli DNA ligase and T4 
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DNA polymerase are used to close gaps in the second DNA 
strand and create blunt ends. Generally, the cDNA created 
is next methylated with EcoRI methylase and EcoRI linkers 
are added (other linkers can be used depending on the 
5 vector to be used) . In subsequent steps the excess 

linkers are removed by restriction digestion and the cDNA 
fragments are inserted into the desired vector. See 
Ausubel et al. # supra and Sambrook et al. (Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor 

10 Laboratory, Cold Spring Harbor, NY, 1990) for detailed 
protocols. Useful vectors include: Agtll, AgtlO, Lambda 
ZAP® II vector, Lambda Uni-ZAP W XR vector, all available 
from Stratagene (La Jolla, CA) . 

The cDNA library must be packaged into phage; this 

15 is most readily accomplished by use of a commercial in 
vitro packaging kit, e.g., Gigapack* II Gold or Gigapack® 
II Plus (Stratagene, La Jolla, CA) . See Ausubel et al. 
(supra) for packaging protocols and suitable host 
strains. The library is preferably amplified soon after 

20 packaging; this step generates sufficient clones for 
multiple screening of the library • See Ausubel et al. 
supra or Sambrook et al. supra for details of 
amplification protocols and procedures for storing the 
amplified library. 

25 screening o f the cDNA Library. To screen the library it 
must be placed on an appropriate host strain (e.g., Y1090 
or Y1088 for AgtlO libraries, C600hflA for AgtlO 
libraries) . After plating the phage, plaques are 
transferred to nitrocellulose or nylon filters (See 

30 Ausubel et al., supra and Sambrook et al. supra). The 

filters are then probed with a 32 P-labelled nick translated 
probe derived from rITF. The probe is preferentially 
generated using a portion of the region of rITF DNA 
coding for the trefoil structure (nucleotides 114 to 230 

35 of SEQ ID NO. 1 which encode cyS32 to phe 71 of SEQ ID NO. 
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2) . This region is conserved between rITF, pS2 and PSP, 
and it is likely that this region is conserved between 
rITF and hITF. Once a plague is identified several cycles 
of plaque purification are required to isolate a pure 
5 clone encoding hITF. A phage DNA isolation is performed 
and the cDNA insert can be subcloned into an appropriate 
vector for restriction mapping and sequencing. If the 
phage vector is Lambda ZAP* II , coinfection with helper 
phage allows rescue and recircularization of pBluescript 
10 SYT phagemid vector (Stratagene, La Jolla, CA) harboring 
the cDNA; alternatively the phage clone is purified and 
the cDNA insert is subcloned into a vector suitable for 
restriction mapping and sequencing. If the clone does 
not contain the entire hITF gene (as assessed by homology 
15 to rITF and the presence of start and stop codons) , the 
library can be rescreened with the original rITF probe 
or, preferably, with a probe generated from the hITF 
clone obtained. If none of the clones contain the intact 
gene, it can be reconstructed from clones which bear 
20 overlapping fragments of hITF. 

Direct Isolation of an hITF Probe bv PCR 

It is possible to isolate part of the hITF gene 
directly from the packaged library or cDNA. To isolate a 
portion of hITF directly from the packaged library, a 
25 pair of oligonucleotide primers and Tag polymerase are 
used to amplify the DNA corresponding to the hITF gene. 
The primers used would be approximately 15-20 nucleotides 
long and correspond in sequence to the 5' -most and 3*- 
most portions of the rITF coding sequence. Friedman et 
30 al. (in PCR Protocols: A Guide to Methods and 

Applications, Innis et al., eds., Academic Press, San 
Diego) describe a procedure for such amplification. 
Briefly, phage particles are disrupted by heating; Taq 
polymerase, primers (300 pmol of each), dNTPs, and Taq 
35 polymerase buffer are added; and the mixture is thermally 
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cycled to amplify DNA. The amplified DNA is isolated by 
agarose gel electrophoresis. The ends of the fragment 
are prepared for ligation into an appropriate vector by 
making them flush with T4 polymerase and, if desired, 
5 adding linkers. Alternatively, a restriction site may be 
engineered into the fragment ,by using primers which have 
sequence added to their 5* ends which sequence will 
generate an appropriate sticky end when digested. For 
example the sequence: 5 * -GGGCGGCCGC3 1 can be added to the 
10 5« end of each primer. This sequence includes the NotI 
restriction site flanked at the 5» end by the sequence: 
GG. The additional nucleotides prevent the 5' ends from 
denaturing and interfering with subsequent restriction 
digestion with NotI. The gel purified DNA of the 
15 appropriate size is next cloned into a cloning vector for 
sequencing and restriction mapping. This clone will not 
have the entire hITF sequence, rather it will be a 
combination of hITF (the region between the sequences 
corresponding to the primers) and rITF (the 5' and 3' 
20 ends which correspond to the primer sequences). However, 
this DNA can be used to generate a labelled probe 
(produced by nick translation or random primer labelling) 
which, since it is the correct hITF sequence, can be used 
in a high stringency screening of the library from which 
25 the cDNA was originally isolated. In an alternative 

approach, cDNA can be used in the above procedure instead 
of a packaged library. This eliminates the steps of 
modifying the cDNA for insertion into a vector as well as 
cDNA packaging and library amplification. Ausubel et al. 
30 supra provides a protocol for amplification of a 

particular DNA fragment directly from cDNA and a protocol 

for amplification from poly (A) + RNA. 

j^p-t^ fi cation of a Presu mptive Human ITF clone 

A nick translated probe derived from rITF cDNA 
35 (corresponding to nucleotides 1 to 431 of SEQ ID No. 1) 
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was used for Northern blot analysis of poly (A) + rna 
derived from human intestinal mucosal scrapings. Probe 
hybridization and blot washing were carried out according 
to standard procedures. Probe (5 x 10 5 cpm/ml 
hybridization buffer) was hybridized to the filter at 
45 °C in 5X SSC with 30% formamide. The filter was then 
washed at 60»C in 5X SSC with 40% formamide. Using this 
protocol a band was clearly visible after an overnight 
exposure of the filter with an intensifying screen. This 
result indicated that there is sufficient homology 
between rITF and hITF to allow the use of probes derived 
from the sequence of the rITF gene for identification of 
the hITF gene. 

A human intestinal cDNA library was obtained from 
15 clontech (Palo Alto, CA) . Alternatively, a human 
intestinal cDNA library may be produced from mucosal 
scrapings as described above. Four oligonucleotide 
probes were selected for screening the library cDNA. Two 
of the probes correspond to sequences within the region 
20 of rITF encoding the trefoil and are referred to as 
internal probes (5«gtacattctgtctcttgcaga-3 • and 5'- 
taaccctgctgctgctggtcctgg3'). The other two probes 
recognize sequences within rITF but outside of the 
trefoil encoding region and are referred to as external 
25 probes (5'-gtttgcgtgctgccatggaga-3 ' and 

5 • -ccgcaattagaacagccttgt-3 • ) . These probes were tested 
for their utility by using them to screen the rat 
intestinal cDNA library described above. Each of the 
four probes could be used to identify a clone harboring 
30 all or part of the rITF gene. This result indicates that 
these probes may be used to screen the human intestinal 
library for the presence of hITF. 

The internal probes were used as described above 
to amplify a DNA fragment from human colon library cDNA 
35 (Clontech, Palo Alto, CA) . Linkers were added to the 
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isolated DNA fragment which was then inserted into 
pBluescript phagemid vector (Stratagene, La Jolla, CA) . 
The region of this clone corresponding to the sequence of 
human cDNA (i.e., not including the sequence 
5 corresponding to the internal probes) was used to make a 
radioactively labelled probe by random oligonucleotide- 
primed synthesis (Ausebel et al. , supra) . This probe was 
then used to screen the human colon cDNA library. This 
screening led to the identification of 29 clones. One of 
10 these clones (HuPCR-ITF) was nick-translated to generate 
a probe for Northern analysis of poly (A) + RNA isolated 
from human intestinal mucosal scrapings. A single band 
of roughly the 6ame size as the rat transcript 
(approximately 0.45 kD) was observed. 
15 Northern analysis of poly (A) + isolated from human 

tissues indicated that RNA corresponding to this probe 
was expressed in the small intestine and the large 
intestine but not ' in the stomach or the liver. These 
results indicate that the clone does not encode the human 
20 homolog of porcine PSP. Porcine PSP is expressed in 
porcine pancreas and is not significantly expressed in 
the small or large intestine. These results also 
distinguish the cloned gene from pS2 which is expressed 
in the stomach. 
25 Figure 6 shows the nucleic acid sequence 

information for human ITF cDNA, along with the deduced 
amino acid sequence in one-letter code (SEQ ID NO: 3). 
This clone was obtained by the methods described above, 
production of hITF 
30 The isolated hITF gene can be cloned into a 

mammalian expression vector for protein expression. 
Appropriate vectors include pMAMneo (Clontech, Palo Alto, 
CA) which provides a RSV-LTR enhancer linked to a 
dexamethasone-inducible MMTV-LTR promoter, an SV40 origin 
35 of replication (allows replication in COS cells) , a 
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neomycin gene, and SV40 splicing and polyadenylation 
sites. This vector can be used to express the protein in 
COS cells, CHO cells, or mouse fibroblasts. The gene may 
also be cloned into a vector for expression in drosophila 
5 cells using the bacoluvirus expression system. 
Purification of Intesti nal Trefpjl Factor 

Intestinal trefoil factor can be purified from 
intestinal mucosal scrapings of human, rats or any other 
species which expresses ITF (pigs and cows may provide a 
10 source of ITF) • The purification procedure used for PSP 
will be useful for the purification of ITF since the 
proteins are likely to be homologous. Jorgensen et al. 
describes a method for purification of PSP (Regulatory 
Peptides 3:207, 1982). The preferred method is the 
15 second approach described by Jorgensen et al. (supra) . 
This method involves chromatography of SP-Sephadex C-25 
and QAE Sephadex A-25 columns (Sigma, St. Louis, MO) in 
acidic buffer. Anti-Intestinal Trefo il Fantnr Monoclone 
Antibodies 

20 Anti-intestinal trefoil factor monoclonal 

antibodies can be raised against synthetic peptides whose 
sequences are based on the deduced amino acid sequence of 
cloned hITF (SEQ ID NO: 3). Most commonly the peptide is 
based on the amino-or carboxy-terminal 10-20 amino acids 

25 of the protein of interest (here hITF) . The peptide is 
usually chemically cross-linked to a carrier molecule 
such as bovine serum albumin or keyhole limpet 
hemocyanin. The peptide is selected with the goal of 
generating antibodies which will cross-react with the 
30 native hITF. Accordingly, the peptide should correspond 
to an antigenic region of the peptide of interest. This 
is accomplished by choosing a region of the protein which 
is (1) surface exposed, e.g., a hydrophobic region or (2) 
relatively flexible, e.g., a loop region or a 0-turn 
35 region. In any case, if the peptide is to be coupled to 
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a carrier, it must have an amino acid with a side chain 
capable of participating in the coupling reaction. See 
Hopp et al. (Mol. Immunol. 20:483, 1983; J. Mol. Biol. 
157:105, 1982) for a discussion of the issues involved in 
5 the selection of antigenic peptides. A second 

consideration is the presence of a protein homologous to 
hITF in the animal to be immunized. If such a protein 
exists, it is important to select a region of hITF which 
is not highly homologous to that homolog. 

10 for hITF, peptides that correspond to the amino- 

terminal or car boxy-terminal 15 amino acids are likely to 
be less homologous across species and exposed to the 
surface (and thus antigenic) . Thus they are preferred for 
the production of monoclonal antibodies. Purified hITF 

15 can also be used for the generation of antibodies. 
Use 

In the practice of the present invention ITF may 
be administered orally, intravenously, or 
intraperitoneal^ for treatment of peptic ulcer diseases, 

20 inflammatory bowel diseases, and for protection of the 
intestinal tract from injury caused by bacterial 
infection, radiation injury or other insults. The mode 
of administration, dosage, and formulation of ITF depends 
upon the condition being treated. 

25 other Embodiments 

Other embodiments are within the following claims. 
For example, ITF may be used to produce monoclonal 
antibodies for the detection of ITF in intestinal tissue 
or blood serum by means of an indirect immunoassay. ITF 

30 may be detectably labelled and used in an in situ 
hybridization assay for the detection of ITF binding 
sites. Labels may include, but are not limited to, 
florescein or a radioactive ligand. 

ITF may be used to protect and stabilize other 

35 proteins. This protection is accomplished by forming a 
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hybrid molecule in which all or part of ITP is fused to 
either the carboxy-terminus or the amino-terminus (or 
both) of the protein of interest. Because ITF is 
resistant to degradation in the digestive system, it will 
5 protect the protein of interest from such degradation. 
As a consequence, the protein of interest is likely to 
remain active in the digestive system and/or will be more 
readily absorbed in an intact form. 
What is claimed is: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEN GTH: 43i 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAAGTTTGCG TGCTGCC 



ATG GAG ACC AGA GCC TTC TGG ATA ACC CTG CTG CTG GTC CTG GTT GCT 
Met Glu Thr Arg Ala Phe Trp He Thr Leu Leu Leu Val Leu Val Ala 
1 5 in 



15 



GGG TCC TCC TGC AAA GCC GAG GAA TTT GTT GGC CTA TCT CCA AGC CAA 
Gly Ser Ser Cys Lys Ala Gin Glu Phe Val Gly Leu ser Pro Ser Gin 
20 25 30 

TGT ATG GCG CCA ACA AAT GTC AGG GTG GAC TGT AAC TAC CCC ACT GTC 
Cys Met Ala Pro Thr Asn Val Arg Val Asp Cye Asn Tyr Pro Thr Val 
35 40 45 

ACA TCA GAG CAG TGT AAC AAC CGT GGT TGC TGT TTT GAC TCC AGC ATC 
Thr Ser Glu Gin Cys Aen Asn Arg Gly Cys Cys Phe Aep Ser Ser He 
50 55 60 

CCA AAT GTG CCC TGG TGC TTC AAA CCT CTG CAA GAG ACA GAA TGT ACA 
Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin Glu Thr Glu Cye Thr 
65 70 75 80 

TTT 
Phe 

TGAAGCTGTC CAGGCTCCAG GAAGGGAGCT CCACACCCTG GACTCTTGCT GATGGTAGTG 
GCCCAGGGTA ACACTCACCC CTGATCTGCT CCCTCGCGCC GGCCAATATA GGAGCTGGGA 
GTCCAGAAGA ATAAAGACCT TACAGTCAGC ACAAGGCTGT TCTAATTGCG G 



17 
65 

113 

161 

209 

257 

260 

320 
380 
431 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 

(B) TYPE: amino acid 

(C) STRAND EDNESS : N/A 

(D) TOPOLOGY: N/A 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Thr Arg Ala Phe Trp lie Thr Leu Leu Leu Val Leu Val Ala 
5 10 15 

Gly Ser Ser Cys Lye Ala Gin Glu Phe Val Gly Leu Ser Pro Ser Gin 
20 25 30 

Cys Met Ala Pro Thr Asn Val Arg Val Ab P Cys Asn Tyr Pro Thr Val 
35 40 45 

Thr Ser Glu Gin Cys Asn Asn Arg Gly Cys Cys Phe Asp Ser Ser He 
50 55 50 

Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin Glu Thr Glu Cys Thr 
65 70 75 80 

Phe 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 4 03 

(B) TYPE i nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



1 



ATG CTG GGG CTG GTC CTG GCC TTG CTG TCC TCC AGC TCT GCT GAG GAG 49 
Met Leu Gly Leu Val Leu Ala Leu Leu Ser Ser Ser Ser Ala Glu Glu 
! 5 10 IS 

TAC GTG GGC CTG TCT CCA AAC CAG TGT GCC GTG CCG GCC AAG GAC AGG 97 
Tyr Val Gly Leu Ser Ala Asn Gin Cys Ala Val Pro Ala Lys Asp Arg 
20 25 30 

GTG GAC TGC GGC TAC CCC CAT GTC ACC CCC AAG GAG TGC AAC AAC CGG 145 
Val Asp Cys Gly Tyr Pro His Val Thr Pro Lys Glu Cys Asn ABn Arg 
35 " 40 45 

GGC TGC TGC TTT GAC TCC AGG ATC CCT GGA GTG CCT TGG TGT TTC AAG 193 
Gly Cye Cys Phe Asp Ser Arg lie Pro Gly Val Pro Trp Cys Phe Lys 
50 55 60 
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CCC CTG ACT AGG AAG ACA GAA TGC ACC TTC 
Pro Leu Thr Arg Lys Thr Glu Cye Thr Phe 
65 70 

TGAGGCACCT CCAGCTGCCC CTGGGATGCA GGCTGAGCAC CCTTGCCCGG CTGTGATTGC 
TGCCAGGCAC TGTTCATCTC AGTTTTTCTG TCCCTTTGCT CCCGGCAAGC TTTCTGCTGA 
AAGTTCATAT CTGGAGCCTG ATGTCTTAAC GAATAAAGGT CCCATGCTCC ACCCGAAAAA 



223 

283 
343 
403 
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Claims 

1 1. A purified nucleic acid encoding an intestinal 

2 trefoil factor. 

1 2. The purified nucleic acid of claim 1 wherein said 

2 intestinal trefoil factor is mammalian intestinal trefoil factor, 

1 3. The purified nucleic acid of claim 2 wherein said 

2 mammal is a human. 

1 4. The purified nucleic acid of claim 2 wherein said 

2 mammal is a rat, 

1 5. The purified nuqleic acid of claim 2 wherein said 

2 mammal comprises a cow, or a pig. 

1 6. The purified nucleic acid of claim 1, said purified 

2 nucleic acid being present within a vector. 

1 7. A cell comprising a vector encoding an intestinal 

2 trefoil factor. 

1 8. A substantially pure intestinal trefoil factor. 

1 9. A therapeutic composition comprising said factor of 

2 claim 8 and a pharmacologically acceptable carrier. 

1 10. A monoclonal antibody which preferentially binds the 

2 factor of claim 8. 

1 11. The monoclonal antibody of claim 10 wherein said 

2 monoclonal antibody is detectably labelled. 



BNSDOCID: <WO 9214837A1J_> 



W0 92/14S37 PCT/US92/01200 



- 24 - 

1 12. A method for detecting intestinal trefoil factor in 

2 a human patient comprising 

3 contacting a biological sample obtained from said patient 

4 with said! monoclonal antibody of claim 10 , and 

5 detecting immune complexes formed with said monoclonal 

6 antibody, 

1 13. The method of claim 12 wherein said biological 

2 sample is an intestinal mucosal scraping. 

1 14. The method of claim 12 wherein said biological 

2 sample is serum. 

1 15. A method for treating digestive disorders in a human 

2 patient, comprising 

3 administering to said patient the therapeutic 

4 composition of claim 9. 

1 16. The factor of claim 8 wherein said factor is 

2 detectably labelled. 

1 17. A method for detecting binding sites for intestinal 

2 trefoil factor in a patient comprising 

3 contacting a biological sample obtained from said patient 

4 with said factor of claim 8, and 

5 detecting said factor bound to said biological sample as 

6 an indication of the presence of said binding sites in said 

7 sample. 

1 18. An isolated DNA comprising a sequence encoding an 

2 intestinal trefoil factor. 

1 19. The isolated DNA of claim 18, wherein said DNA is 



2 present within a vector. 
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1 20. The isolated DNA of claim 18, wherein said 

2 intestinal trefoil factor is a mammalian intestinal trefoil 

3 factor. 

1 21. The isolated DNA of claim 20, wherein said mammal is 

i 

2 a human. 

1 22. The isolated DNA of claim 20, wherein said mammal is 

2 a rat. 

t 

1 23. The isolated DNA of claim 20, wherein said mammal 

2 comprises a cow, or a pig. 
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gatgctggggctggtcctggccttgctgtcctccagctctgctgaggagtacgtgggcct 
+ 4 + + + + 

1 60 
MLGLVLALLSSSSAEEYVGL 

gtctgcaaaccagtgtgccgtgccggccaaggacagggtggactgcggctacccccatgt 

+ + + + 4 + 

61 12° 

SAHQCAVPAKDRV.DCCYPHV 



cacccccaaggagtgcaacaaccggggctgctgctttgactccaggatccctggagtgcc 

1 — + 4 + 4 —> 

21 181 
TPKECNNRGCCFDSRIPGVP 



ttggtgtttcaagcccctgactaggaagacagaatgcaccttctgaggcacctccagctg 

+ 4 4 4 + ■» 

WCFKPLTRKTRCTF* 



cccctgggatgcaggctgagcacccttgcccggctgtgattgctgccaggcactgttcat 

+ 4 + + + + 

241 30 
ctcagtttttctgtccctttgctcccggcaagctttctgctgaaagttcatatctggagc 

4— 4 + 

301 

ctgatgtcttaacgaataaaggtcccatgctccacccgAAAAA 

+ 4 4 4 03 

361 



4 4 4 

360 
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